Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 5780 |
| Missing cells | 111 |
| Missing cells (%) | 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.6 MiB |
| Average record size in memory | 660.9 B |
Variable types
| Numeric | 8 |
|---|---|
| DateTime | 2 |
| Categorical | 5 |
| Text | 2 |
Unnamed: 0 is highly overall correlated with color | High correlation |
color is highly overall correlated with Unnamed: 0 and 2 other fields | High correlation |
distance is highly overall correlated with fare and 2 other fields | High correlation |
dropoff_borough is highly overall correlated with color and 1 other fields | High correlation |
fare is highly overall correlated with distance and 2 other fields | High correlation |
log_tip is highly overall correlated with payment and 1 other fields | High correlation |
log_total is highly overall correlated with distance and 2 other fields | High correlation |
payment is highly overall correlated with log_tip and 1 other fields | High correlation |
pickup_borough is highly overall correlated with color and 1 other fields | High correlation |
tip is highly overall correlated with log_tip and 1 other fields | High correlation |
total is highly overall correlated with distance and 2 other fields | High correlation |
tolls is highly imbalanced (97.9%) | Imbalance |
pickup_borough is highly imbalanced (62.8%) | Imbalance |
dropoff_borough is highly imbalanced (59.6%) | Imbalance |
Unnamed: 0 is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
passengers has 85 (1.5%) zeros | Zeros |
tip has 2070 (35.8%) zeros | Zeros |
log_tip has 2070 (35.8%) zeros | Zeros |
Reproduction
| Analysis started | 2024-09-29 00:18:29.677755 |
|---|---|
| Analysis finished | 2024-09-29 00:18:51.419587 |
| Duration | 21.74 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ)
HIGH CORRELATION  UNIFORM  UNIQUE 
| Distinct | 5780 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3207.3581 |
| Minimum | 0 |
|---|---|
| Maximum | 6432 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 219.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 324.95 |
| Q1 | 1590.75 |
| median | 3206.5 |
| Q3 | 4813.25 |
| 95-th percentile | 6108.05 |
| Maximum | 6432 |
| Range | 6432 |
| Interquartile range (IQR) | 3222.5 |
Descriptive statistics
| Standard deviation | 1857.1615 |
|---|---|
| Coefficient of variation (CV) | 0.57903155 |
| Kurtosis | -1.2022528 |
| Mean | 3207.3581 |
| Median Absolute Deviation (MAD) | 1613 |
| Skewness | 0.004130154 |
| Sum | 18538530 |
| Variance | 3449049 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 4290 | 1 | < 0.1% |
| 4288 | 1 | < 0.1% |
| 4287 | 1 | < 0.1% |
| 4285 | 1 | < 0.1% |
| 4284 | 1 | < 0.1% |
| 4283 | 1 | < 0.1% |
| 4282 | 1 | < 0.1% |
| 4281 | 1 | < 0.1% |
| 4279 | 1 | < 0.1% |
| Other values (5770) | 5770 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 6432 | 1 | |
| 6431 | 1 | |
| 6430 | 1 | |
| 6428 | 1 | |
| 6427 | 1 | |
| 6426 | 1 | |
| 6425 | 1 | |
| 6424 | 1 | |
| 6423 | 1 | |
| 6422 | 1 |
pickup
Date
| Distinct | 5767 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 219.4 KiB |
| Minimum | 2019-02-28 23:29:03 |
|---|---|
| Maximum | 2019-03-31 23:15:03 |
dropoff
Date
| Distinct | 5774 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 219.4 KiB |
| Minimum | 2019-02-28 23:32:35 |
|---|---|
| Maximum | 2019-03-31 23:27:12 |
passengers
Real number (ℝ)
ZEROS 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5401384 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 85 |
| Zeros (%) | 1.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 219.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.2048877 |
|---|---|
| Coefficient of variation (CV) | 0.78232427 |
| Kurtosis | 4.8353555 |
| Mean | 1.5401384 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.3523074 |
| Sum | 8902 |
| Variance | 1.4517543 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 4207 | |
| 2 | 783 | 13.5% |
| 5 | 254 | 4.4% |
| 3 | 215 | 3.7% |
| 6 | 135 | 2.3% |
| 4 | 101 | 1.7% |
| 0 | 85 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 85 | 1.5% |
| 1 | 4207 | |
| 2 | 783 | 13.5% |
| 3 | 215 | 3.7% |
| 4 | 101 | 1.7% |
| 5 | 254 | 4.4% |
| 6 | 135 | 2.3% |
| Value | Count | Frequency (%) |
| 6 | 135 | 2.3% |
| 5 | 254 | 4.4% |
| 4 | 101 | 1.7% |
| 3 | 215 | 3.7% |
| 2 | 783 | 13.5% |
| 1 | 4207 | |
| 0 | 85 | 1.5% |
distance
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 697 |
|---|---|
| Distinct (%) | 12.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0542405 |
| Minimum | 0 |
|---|---|
| Maximum | 17.1 |
| Zeros | 20 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 219.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.5 |
| Q1 | 0.93 |
| median | 1.5 |
| Q3 | 2.6 |
| 95-th percentile | 5.65 |
| Maximum | 17.1 |
| Range | 17.1 |
| Interquartile range (IQR) | 1.67 |
Descriptive statistics
| Standard deviation | 1.6830888 |
|---|---|
| Coefficient of variation (CV) | 0.81932414 |
| Kurtosis | 5.6583398 |
| Mean | 2.0542405 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | 2.0504716 |
| Sum | 11873.51 |
| Variance | 2.832788 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.8 | 131 | 2.3% |
| 1 | 118 | 2.0% |
| 0.9 | 116 | 2.0% |
| 0.7 | 108 | 1.9% |
| 1.2 | 107 | 1.9% |
| 1.1 | 106 | 1.8% |
| 0.6 | 104 | 1.8% |
| 1.3 | 97 | 1.7% |
| 1.6 | 89 | 1.5% |
| 1.4 | 83 | 1.4% |
| Other values (687) | 4721 |
| Value | Count | Frequency (%) |
| 0 | 20 | |
| 0.02 | 1 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.1 | 4 | 0.1% |
| 0.12 | 2 | < 0.1% |
| 0.13 | 1 | < 0.1% |
| 0.15 | 2 | < 0.1% |
| 0.16 | 2 | < 0.1% |
| 0.17 | 1 | < 0.1% |
| 0.2 | 14 |
| Value | Count | Frequency (%) |
| 17.1 | 1 | |
| 11.93 | 1 | |
| 11.52 | 1 | |
| 11.48 | 1 | |
| 11.2 | 1 | |
| 11.19 | 1 | |
| 11.14 | 1 | |
| 11.05 | 1 | |
| 10.97 | 1 | |
| 10.94 | 1 |
fare
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 116 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.19524 |
| Minimum | 2.5 |
|---|---|
| Maximum | 33.78 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 219.4 KiB |
Quantile statistics
| Minimum | 2.5 |
|---|---|
| 5-th percentile | 4.5 |
| Q1 | 6 |
| median | 8.5 |
| Q3 | 12.5 |
| 95-th percentile | 21.405 |
| Maximum | 33.78 |
| Range | 31.28 |
| Interquartile range (IQR) | 6.5 |
Descriptive statistics
| Standard deviation | 5.4080392 |
|---|---|
| Coefficient of variation (CV) | 0.53044743 |
| Kurtosis | 1.9667494 |
| Mean | 10.19524 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.3708082 |
| Sum | 58928.49 |
| Variance | 29.246888 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.5 | 349 | 6.0% |
| 6 | 345 | 6.0% |
| 6.5 | 323 | 5.6% |
| 5 | 313 | 5.4% |
| 5.5 | 312 | 5.4% |
| 7 | 296 | 5.1% |
| 8 | 258 | 4.5% |
| 8.5 | 254 | 4.4% |
| 9.5 | 229 | 4.0% |
| 4.5 | 217 | 3.8% |
| Other values (106) | 2884 |
| Value | Count | Frequency (%) |
| 2.5 | 9 | 0.2% |
| 3 | 21 | 0.4% |
| 3.5 | 91 | 1.6% |
| 4 | 154 | |
| 4.5 | 217 | |
| 5 | 313 | |
| 5.5 | 312 | |
| 6 | 345 | |
| 6.5 | 323 | |
| 7 | 296 |
| Value | Count | Frequency (%) |
| 33.78 | 1 | < 0.1% |
| 33.67 | 1 | < 0.1% |
| 33.65 | 1 | < 0.1% |
| 33.54 | 1 | < 0.1% |
| 33.5 | 3 | |
| 33.27 | 1 | < 0.1% |
| 33.22 | 1 | < 0.1% |
| 33.01 | 1 | < 0.1% |
| 33 | 4 | |
| 32.5 | 2 |
tip
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 305 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5851955 |
| Minimum | 0 |
|---|---|
| Maximum | 6.82 |
| Zeros | 2070 |
| Zeros (%) | 35.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 219.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1.65 |
| Q3 | 2.56 |
| 95-th percentile | 4.35 |
| Maximum | 6.82 |
| Range | 6.82 |
| Interquartile range (IQR) | 2.56 |
Descriptive statistics
| Standard deviation | 1.4900555 |
|---|---|
| Coefficient of variation (CV) | 0.93998215 |
| Kurtosis | -0.34465861 |
| Mean | 1.5851955 |
| Median Absolute Deviation (MAD) | 1.53 |
| Skewness | 0.59670723 |
| Sum | 9162.43 |
| Variance | 2.2202653 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2070 | |
| 1 | 310 | 5.4% |
| 2 | 205 | 3.5% |
| 2.16 | 89 | 1.5% |
| 1.86 | 79 | 1.4% |
| 2.36 | 74 | 1.3% |
| 2.26 | 74 | 1.3% |
| 1.96 | 73 | 1.3% |
| 3 | 71 | 1.2% |
| 1.5 | 71 | 1.2% |
| Other values (295) | 2664 |
| Value | Count | Frequency (%) |
| 0 | 2070 | |
| 0.01 | 6 | 0.1% |
| 0.02 | 2 | < 0.1% |
| 0.08 | 1 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.1 | 1 | < 0.1% |
| 0.2 | 1 | < 0.1% |
| 0.25 | 1 | < 0.1% |
| 0.37 | 1 | < 0.1% |
| 0.39 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6.82 | 1 | < 0.1% |
| 6.8 | 1 | < 0.1% |
| 6.7 | 1 | < 0.1% |
| 6.62 | 1 | < 0.1% |
| 6.58 | 2 | |
| 6.55 | 2 | |
| 6.54 | 1 | < 0.1% |
| 6.51 | 1 | < 0.1% |
| 6.45 | 1 | < 0.1% |
| 6.39 | 3 |
tolls
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 512.9 KiB |
| 0.0 | |
|---|---|
| 5.76 | 18 |
| 5.54 | 1 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.0032872 |
| Min length | 3 |
Characters and Unicode
| Total characters | 17359 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 5761 | |
| 5.76 | 18 | 0.3% |
| 5.54 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 5761 | |
| 5.76 | 18 | 0.3% |
| 5.54 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 11522 | |
| . | 5780 | |
| 5 | 20 | 0.1% |
| 7 | 18 | 0.1% |
| 6 | 18 | 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17359 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 11522 | |
| . | 5780 | |
| 5 | 20 | 0.1% |
| 7 | 18 | 0.1% |
| 6 | 18 | 0.1% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17359 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 11522 | |
| . | 5780 | |
| 5 | 20 | 0.1% |
| 7 | 18 | 0.1% |
| 6 | 18 | 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17359 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 11522 | |
| . | 5780 | |
| 5 | 20 | 0.1% |
| 7 | 18 | 0.1% |
| 6 | 18 | 0.1% |
| 4 | 1 | < 0.1% |
total
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 539 |
|---|---|
| Distinct (%) | 9.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.967836 |
| Minimum | 4.8 |
|---|---|
| Maximum | 34.55 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 219.4 KiB |
Quantile statistics
| Minimum | 4.8 |
|---|---|
| 5-th percentile | 7.3 |
| Q1 | 10.56 |
| median | 13.55 |
| Q3 | 18.3 |
| 95-th percentile | 27.36 |
| Maximum | 34.55 |
| Range | 29.75 |
| Interquartile range (IQR) | 7.74 |
Descriptive statistics
| Standard deviation | 6.0366074 |
|---|---|
| Coefficient of variation (CV) | 0.4033053 |
| Kurtosis | 0.66567905 |
| Mean | 14.967836 |
| Median Absolute Deviation (MAD) | 3.59 |
| Skewness | 1.0083505 |
| Sum | 86514.09 |
| Variance | 36.440629 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9.3 | 146 | 2.5% |
| 11.3 | 129 | 2.2% |
| 11.8 | 122 | 2.1% |
| 10.3 | 121 | 2.1% |
| 9.8 | 121 | 2.1% |
| 8.8 | 118 | 2.0% |
| 12.3 | 101 | 1.7% |
| 8.3 | 100 | 1.7% |
| 10.8 | 98 | 1.7% |
| 12.8 | 95 | 1.6% |
| Other values (529) | 4629 |
| Value | Count | Frequency (%) |
| 4.8 | 19 | |
| 4.81 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 5.28 | 1 | < 0.1% |
| 5.3 | 17 | |
| 5.38 | 1 | < 0.1% |
| 5.55 | 1 | < 0.1% |
| 5.76 | 1 | < 0.1% |
| 5.8 | 38 | |
| 6 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 34.55 | 2 | < 0.1% |
| 34.51 | 1 | < 0.1% |
| 34.3 | 10 | |
| 34.28 | 1 | < 0.1% |
| 34.27 | 1 | < 0.1% |
| 34.26 | 1 | < 0.1% |
| 34.17 | 1 | < 0.1% |
| 34.15 | 1 | < 0.1% |
| 34.12 | 1 | < 0.1% |
| 34.1 | 1 | < 0.1% |
color
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 529.0 KiB |
| yellow | |
|---|---|
| green |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.850173 |
| Min length | 5 |
Characters and Unicode
| Total characters | 33814 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | yellow |
|---|---|
| 2nd row | yellow |
| 3rd row | yellow |
| 4th row | yellow |
| 5th row | yellow |
Common Values
| Value | Count | Frequency (%) |
| yellow | 4914 | |
| green | 866 | 15.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| yellow | 4914 | |
| green | 866 | 15.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 9828 | |
| e | 6646 | |
| y | 4914 | |
| o | 4914 | |
| w | 4914 | |
| g | 866 | 2.6% |
| r | 866 | 2.6% |
| n | 866 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 33814 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 9828 | |
| e | 6646 | |
| y | 4914 | |
| o | 4914 | |
| w | 4914 | |
| g | 866 | 2.6% |
| r | 866 | 2.6% |
| n | 866 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 33814 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 9828 | |
| e | 6646 | |
| y | 4914 | |
| o | 4914 | |
| w | 4914 | |
| g | 866 | 2.6% |
| r | 866 | 2.6% |
| n | 866 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 33814 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 9828 | |
| e | 6646 | |
| y | 4914 | |
| o | 4914 | |
| w | 4914 | |
| g | 866 | 2.6% |
| r | 866 | 2.6% |
| n | 866 | 2.6% |
payment
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 37 |
| Missing (%) | 0.6% |
| Memory size | 546.0 KiB |
| credit card | |
|---|---|
| cash |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 8.9364444 |
| Min length | 4 |
Characters and Unicode
| Total characters | 51322 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | credit card |
|---|---|
| 2nd row | cash |
| 3rd row | credit card |
| 4th row | credit card |
| 5th row | credit card |
Common Values
| Value | Count | Frequency (%) |
| credit card | 4050 | |
| cash | 1693 | |
| (Missing) | 37 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| credit | 4050 | |
| card | 4050 | |
| cash | 1693 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 9793 | |
| r | 8100 | |
| d | 8100 | |
| a | 5743 | |
| e | 4050 | |
| i | 4050 | |
| t | 4050 | |
| 4050 | ||
| s | 1693 | 3.3% |
| h | 1693 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 51322 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| c | 9793 | |
| r | 8100 | |
| d | 8100 | |
| a | 5743 | |
| e | 4050 | |
| i | 4050 | |
| t | 4050 | |
| 4050 | ||
| s | 1693 | 3.3% |
| h | 1693 | 3.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 51322 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| c | 9793 | |
| r | 8100 | |
| d | 8100 | |
| a | 5743 | |
| e | 4050 | |
| i | 4050 | |
| t | 4050 | |
| 4050 | ||
| s | 1693 | 3.3% |
| h | 1693 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 51322 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| c | 9793 | |
| r | 8100 | |
| d | 8100 | |
| a | 5743 | |
| e | 4050 | |
| i | 4050 | |
| t | 4050 | |
| 4050 | ||
| s | 1693 | 3.3% |
| h | 1693 | 3.3% |
pickup_zone
Text
| Distinct | 178 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 16 |
| Missing (%) | 0.3% |
| Memory size | 587.2 KiB |
Length
| Max length | 33 |
|---|---|
| Median length | 28 |
| Mean length | 16.275156 |
| Min length | 4 |
Characters and Unicode
| Total characters | 93810 |
|---|---|
| Distinct characters | 54 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 32 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Lenox Hill West |
|---|---|
| 2nd row | Upper West Side South |
| 3rd row | Alphabet City |
| 4th row | Midtown East |
| 5th row | Times Sq/Theatre District |
| Value | Count | Frequency (%) |
| east | 1633 | 11.4% |
| west | 1011 | 7.0% |
| south | 773 | 5.4% |
| north | 772 | 5.4% |
| side | 714 | 5.0% |
| midtown | 668 | 4.6% |
| upper | 617 | 4.3% |
| village | 495 | 3.4% |
| sq | 418 | 2.9% |
| hill | 413 | 2.9% |
| Other values (183) | 6856 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 8693 | 9.3% |
| 8606 | 9.2% | |
| e | 7849 | 8.4% |
| i | 6110 | 6.5% |
| a | 5874 | 6.3% |
| r | 5420 | 5.8% |
| o | 5227 | 5.6% |
| n | 5132 | 5.5% |
| s | 4661 | 5.0% |
| l | 4385 | 4.7% |
| Other values (44) | 31853 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 93810 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 8693 | 9.3% |
| 8606 | 9.2% | |
| e | 7849 | 8.4% |
| i | 6110 | 6.5% |
| a | 5874 | 6.3% |
| r | 5420 | 5.8% |
| o | 5227 | 5.6% |
| n | 5132 | 5.5% |
| s | 4661 | 5.0% |
| l | 4385 | 4.7% |
| Other values (44) | 31853 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 93810 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 8693 | 9.3% |
| 8606 | 9.2% | |
| e | 7849 | 8.4% |
| i | 6110 | 6.5% |
| a | 5874 | 6.3% |
| r | 5420 | 5.8% |
| o | 5227 | 5.6% |
| n | 5132 | 5.5% |
| s | 4661 | 5.0% |
| l | 4385 | 4.7% |
| Other values (44) | 31853 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 93810 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 8693 | 9.3% |
| 8606 | 9.2% | |
| e | 7849 | 8.4% |
| i | 6110 | 6.5% |
| a | 5874 | 6.3% |
| r | 5420 | 5.8% |
| o | 5227 | 5.6% |
| n | 5132 | 5.5% |
| s | 4661 | 5.0% |
| l | 4385 | 4.7% |
| Other values (44) | 31853 |
dropoff_zone
Text
| Distinct | 196 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 21 |
| Missing (%) | 0.4% |
| Memory size | 587.3 KiB |
Length
| Max length | 35 |
|---|---|
| Median length | 29 |
| Mean length | 16.331134 |
| Min length | 4 |
Characters and Unicode
| Total characters | 94051 |
|---|---|
| Distinct characters | 54 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 24 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | UN/Turtle Bay South |
|---|---|
| 2nd row | Upper West Side South |
| 3rd row | West Village |
| 4th row | Yorkville West |
| 5th row | Midtown East |
| Value | Count | Frequency (%) |
| east | 1543 | 10.7% |
| west | 1015 | 7.0% |
| north | 874 | 6.0% |
| south | 752 | 5.2% |
| side | 751 | 5.2% |
| upper | 658 | 4.6% |
| midtown | 586 | 4.1% |
| hill | 515 | 3.6% |
| village | 468 | 3.2% |
| sq | 308 | 2.1% |
| Other values (204) | 6988 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8699 | 9.2% | |
| t | 8510 | 9.0% |
| e | 7980 | 8.5% |
| i | 5952 | 6.3% |
| a | 5951 | 6.3% |
| r | 5683 | 6.0% |
| o | 5033 | 5.4% |
| n | 4745 | 5.0% |
| s | 4661 | 5.0% |
| l | 4627 | 4.9% |
| Other values (44) | 32210 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 94051 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 8699 | 9.2% | |
| t | 8510 | 9.0% |
| e | 7980 | 8.5% |
| i | 5952 | 6.3% |
| a | 5951 | 6.3% |
| r | 5683 | 6.0% |
| o | 5033 | 5.4% |
| n | 4745 | 5.0% |
| s | 4661 | 5.0% |
| l | 4627 | 4.9% |
| Other values (44) | 32210 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 94051 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 8699 | 9.2% | |
| t | 8510 | 9.0% |
| e | 7980 | 8.5% |
| i | 5952 | 6.3% |
| a | 5951 | 6.3% |
| r | 5683 | 6.0% |
| o | 5033 | 5.4% |
| n | 4745 | 5.0% |
| s | 4661 | 5.0% |
| l | 4627 | 4.9% |
| Other values (44) | 32210 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 94051 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 8699 | 9.2% | |
| t | 8510 | 9.0% |
| e | 7980 | 8.5% |
| i | 5952 | 6.3% |
| a | 5951 | 6.3% |
| r | 5683 | 6.0% |
| o | 5033 | 5.4% |
| n | 4745 | 5.0% |
| s | 4661 | 5.0% |
| l | 4627 | 4.9% |
| Other values (44) | 32210 |
pickup_borough
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 16 |
| Missing (%) | 0.3% |
| Memory size | 544.9 KiB |
| Manhattan | |
|---|---|
| Queens | 359 |
| Brooklyn | 329 |
| Bronx | 75 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.704025 |
| Min length | 5 |
Characters and Unicode
| Total characters | 50170 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Manhattan |
|---|---|
| 2nd row | Manhattan |
| 3rd row | Manhattan |
| 4th row | Manhattan |
| 5th row | Manhattan |
Common Values
| Value | Count | Frequency (%) |
| Manhattan | 5001 | |
| Queens | 359 | 6.2% |
| Brooklyn | 329 | 5.7% |
| Bronx | 75 | 1.3% |
| (Missing) | 16 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| manhattan | 5001 | |
| queens | 359 | 6.2% |
| brooklyn | 329 | 5.7% |
| bronx | 75 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 15003 | |
| n | 10765 | |
| t | 10002 | |
| M | 5001 | 10.0% |
| h | 5001 | 10.0% |
| o | 733 | 1.5% |
| e | 718 | 1.4% |
| B | 404 | 0.8% |
| r | 404 | 0.8% |
| Q | 359 | 0.7% |
| Other values (6) | 1780 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 50170 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 15003 | |
| n | 10765 | |
| t | 10002 | |
| M | 5001 | 10.0% |
| h | 5001 | 10.0% |
| o | 733 | 1.5% |
| e | 718 | 1.4% |
| B | 404 | 0.8% |
| r | 404 | 0.8% |
| Q | 359 | 0.7% |
| Other values (6) | 1780 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 50170 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 15003 | |
| n | 10765 | |
| t | 10002 | |
| M | 5001 | 10.0% |
| h | 5001 | 10.0% |
| o | 733 | 1.5% |
| e | 718 | 1.4% |
| B | 404 | 0.8% |
| r | 404 | 0.8% |
| Q | 359 | 0.7% |
| Other values (6) | 1780 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 50170 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 15003 | |
| n | 10765 | |
| t | 10002 | |
| M | 5001 | 10.0% |
| h | 5001 | 10.0% |
| o | 733 | 1.5% |
| e | 718 | 1.4% |
| B | 404 | 0.8% |
| r | 404 | 0.8% |
| Q | 359 | 0.7% |
| Other values (6) | 1780 | 3.5% |
dropoff_borough
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 21 |
| Missing (%) | 0.4% |
| Memory size | 544.7 KiB |
| Manhattan | |
|---|---|
| Brooklyn | 383 |
| Queens | 363 |
| Bronx | 99 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.6756381 |
| Min length | 5 |
Characters and Unicode
| Total characters | 49963 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Manhattan |
|---|---|
| 2nd row | Manhattan |
| 3rd row | Manhattan |
| 4th row | Manhattan |
| 5th row | Manhattan |
Common Values
| Value | Count | Frequency (%) |
| Manhattan | 4914 | |
| Brooklyn | 383 | 6.6% |
| Queens | 363 | 6.3% |
| Bronx | 99 | 1.7% |
| (Missing) | 21 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| manhattan | 4914 | |
| brooklyn | 383 | 6.7% |
| queens | 363 | 6.3% |
| bronx | 99 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 14742 | |
| n | 10673 | |
| t | 9828 | |
| M | 4914 | 9.8% |
| h | 4914 | 9.8% |
| o | 865 | 1.7% |
| e | 726 | 1.5% |
| B | 482 | 1.0% |
| r | 482 | 1.0% |
| k | 383 | 0.8% |
| Other values (6) | 1954 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 49963 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 14742 | |
| n | 10673 | |
| t | 9828 | |
| M | 4914 | 9.8% |
| h | 4914 | 9.8% |
| o | 865 | 1.7% |
| e | 726 | 1.5% |
| B | 482 | 1.0% |
| r | 482 | 1.0% |
| k | 383 | 0.8% |
| Other values (6) | 1954 | 3.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 49963 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 14742 | |
| n | 10673 | |
| t | 9828 | |
| M | 4914 | 9.8% |
| h | 4914 | 9.8% |
| o | 865 | 1.7% |
| e | 726 | 1.5% |
| B | 482 | 1.0% |
| r | 482 | 1.0% |
| k | 383 | 0.8% |
| Other values (6) | 1954 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 49963 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 14742 | |
| n | 10673 | |
| t | 9828 | |
| M | 4914 | 9.8% |
| h | 4914 | 9.8% |
| o | 865 | 1.7% |
| e | 726 | 1.5% |
| B | 482 | 1.0% |
| r | 482 | 1.0% |
| k | 383 | 0.8% |
| Other values (6) | 1954 | 3.9% |
log_tip
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 305 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.76578564 |
| Minimum | 0 |
|---|---|
| Maximum | 2.0566846 |
| Zeros | 2070 |
| Zeros (%) | 35.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 219.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.97455964 |
| Q3 | 1.2697605 |
| 95-th percentile | 1.6770966 |
| Maximum | 2.0566846 |
| Range | 2.0566846 |
| Interquartile range (IQR) | 1.2697605 |
Descriptive statistics
| Standard deviation | 0.62691948 |
|---|---|
| Coefficient of variation (CV) | 0.81866184 |
| Kurtosis | -1.4747582 |
| Mean | 0.76578564 |
| Median Absolute Deviation (MAD) | 0.47235934 |
| Skewness | -0.11079472 |
| Sum | 4426.241 |
| Variance | 0.39302804 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2070 | |
| 0.6931471806 | 310 | 5.4% |
| 1.098612289 | 205 | 3.5% |
| 1.150572028 | 89 | 1.5% |
| 1.050821625 | 79 | 1.4% |
| 1.211940974 | 74 | 1.3% |
| 1.181727195 | 74 | 1.3% |
| 1.085189268 | 73 | 1.3% |
| 1.386294361 | 71 | 1.2% |
| 0.9162907319 | 71 | 1.2% |
| Other values (295) | 2664 |
| Value | Count | Frequency (%) |
| 0 | 2070 | |
| 0.009950330853 | 6 | 0.1% |
| 0.0198026273 | 2 | < 0.1% |
| 0.07696104114 | 1 | < 0.1% |
| 0.08617769624 | 1 | < 0.1% |
| 0.0953101798 | 1 | < 0.1% |
| 0.1823215568 | 1 | < 0.1% |
| 0.2231435513 | 1 | < 0.1% |
| 0.3148107398 | 1 | < 0.1% |
| 0.3293037471 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2.056684555 | 1 | < 0.1% |
| 2.054123734 | 1 | < 0.1% |
| 2.041220329 | 1 | < 0.1% |
| 2.03077637 | 1 | < 0.1% |
| 2.0255132 | 2 | |
| 2.021547563 | 2 | |
| 2.020222182 | 1 | < 0.1% |
| 2.016235466 | 1 | < 0.1% |
| 2.008214032 | 1 | < 0.1% |
| 2.000127735 | 3 |
log_total
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 539 |
|---|---|
| Distinct (%) | 9.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7042991 |
| Minimum | 1.7578579 |
|---|---|
| Maximum | 3.5709402 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 219.4 KiB |
Quantile statistics
| Minimum | 1.7578579 |
|---|---|
| 5-th percentile | 2.1162555 |
| Q1 | 2.4475509 |
| median | 2.677591 |
| Q3 | 2.9601051 |
| 95-th percentile | 3.3449797 |
| Maximum | 3.5709402 |
| Range | 1.8130822 |
| Interquartile range (IQR) | 0.51255423 |
Descriptive statistics
| Standard deviation | 0.36131995 |
|---|---|
| Coefficient of variation (CV) | 0.13360947 |
| Kurtosis | -0.34760902 |
| Mean | 2.7042991 |
| Median Absolute Deviation (MAD) | 0.25278827 |
| Skewness | 0.16656486 |
| Sum | 15630.849 |
| Variance | 0.13055211 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.332143895 | 146 | 2.5% |
| 2.509599262 | 129 | 2.2% |
| 2.549445171 | 122 | 2.1% |
| 2.424802726 | 121 | 2.1% |
| 2.379546134 | 121 | 2.1% |
| 2.282382386 | 118 | 2.0% |
| 2.587764035 | 101 | 1.7% |
| 2.2300144 | 100 | 1.7% |
| 2.468099531 | 98 | 1.7% |
| 2.624668592 | 95 | 1.6% |
| Other values (529) | 4629 |
| Value | Count | Frequency (%) |
| 1.757857918 | 19 | |
| 1.759580571 | 1 | < 0.1% |
| 1.791759469 | 2 | < 0.1% |
| 1.83736998 | 1 | < 0.1% |
| 1.840549633 | 17 | |
| 1.853168097 | 1 | < 0.1% |
| 1.87946505 | 1 | < 0.1% |
| 1.91102289 | 1 | < 0.1% |
| 1.916922612 | 38 | |
| 1.945910149 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 3.570940156 | 2 | < 0.1% |
| 3.569814347 | 1 | < 0.1% |
| 3.563882964 | 10 | |
| 3.563316231 | 1 | < 0.1% |
| 3.563032744 | 1 | < 0.1% |
| 3.562749177 | 1 | < 0.1% |
| 3.560193446 | 1 | < 0.1% |
| 3.559624618 | 1 | < 0.1% |
| 3.558770769 | 1 | < 0.1% |
| 3.55820113 | 1 | < 0.1% |
| Unnamed: 0 | color | distance | dropoff_borough | fare | log_tip | log_total | passengers | payment | pickup_borough | tip | tolls | total | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Unnamed: 0 | 1.000 | 0.897 | 0.049 | 0.307 | 0.029 | -0.146 | -0.094 | -0.072 | 0.109 | 0.352 | -0.146 | 0.000 | -0.094 |
| color | 0.897 | 1.000 | 0.116 | 0.598 | 0.117 | 0.278 | 0.297 | 0.144 | 0.126 | 0.686 | 0.278 | 0.033 | 0.286 |
| distance | 0.049 | 0.116 | 1.000 | 0.193 | 0.921 | 0.233 | 0.850 | 0.006 | 0.044 | 0.157 | 0.233 | 0.104 | 0.850 |
| dropoff_borough | 0.307 | 0.598 | 0.193 | 1.000 | 0.162 | 0.167 | 0.153 | 0.047 | 0.149 | 0.757 | 0.168 | 0.047 | 0.151 |
| fare | 0.029 | 0.117 | 0.921 | 0.162 | 1.000 | 0.254 | 0.928 | -0.001 | 0.055 | 0.145 | 0.254 | 0.092 | 0.928 |
| log_tip | -0.146 | 0.278 | 0.233 | 0.167 | 0.254 | 1.000 | 0.497 | 0.030 | 0.869 | 0.168 | 1.000 | 0.043 | 0.497 |
| log_total | -0.094 | 0.297 | 0.850 | 0.153 | 0.928 | 0.497 | 1.000 | 0.025 | 0.323 | 0.130 | 0.497 | 0.134 | 1.000 |
| passengers | -0.072 | 0.144 | 0.006 | 0.047 | -0.001 | 0.030 | 0.025 | 1.000 | 0.034 | 0.052 | 0.030 | 0.000 | 0.025 |
| payment | 0.109 | 0.126 | 0.044 | 0.149 | 0.055 | 0.869 | 0.323 | 0.034 | 1.000 | 0.159 | 0.859 | 0.000 | 0.282 |
| pickup_borough | 0.352 | 0.686 | 0.157 | 0.757 | 0.145 | 0.168 | 0.130 | 0.052 | 0.159 | 1.000 | 0.167 | 0.050 | 0.131 |
| tip | -0.146 | 0.278 | 0.233 | 0.168 | 0.254 | 1.000 | 0.497 | 0.030 | 0.859 | 0.167 | 1.000 | 0.074 | 0.497 |
| tolls | 0.000 | 0.033 | 0.104 | 0.047 | 0.092 | 0.043 | 0.134 | 0.000 | 0.000 | 0.050 | 0.074 | 1.000 | 0.142 |
| total | -0.094 | 0.286 | 0.850 | 0.151 | 0.928 | 0.497 | 1.000 | 0.025 | 0.282 | 0.131 | 0.497 | 0.142 | 1.000 |
| Unnamed: 0 | pickup | dropoff | passengers | distance | fare | tip | tolls | total | color | payment | pickup_zone | dropoff_zone | pickup_borough | dropoff_borough | log_tip | log_total | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 2019-03-23 20:21:09 | 2019-03-23 20:27:24 | 1 | 1.60 | 7.0 | 2.15 | 0.0 | 12.95 | yellow | credit card | Lenox Hill West | UN/Turtle Bay South | Manhattan | Manhattan | 1.147402 | 2.635480 |
| 1 | 1 | 2019-03-04 16:11:55 | 2019-03-04 16:19:00 | 1 | 0.79 | 5.0 | 0.00 | 0.0 | 9.30 | yellow | cash | Upper West Side South | Upper West Side South | Manhattan | Manhattan | 0.000000 | 2.332144 |
| 2 | 2 | 2019-03-27 17:53:01 | 2019-03-27 18:00:25 | 1 | 1.37 | 7.5 | 2.36 | 0.0 | 14.16 | yellow | credit card | Alphabet City | West Village | Manhattan | Manhattan | 1.211941 | 2.718660 |
| 4 | 4 | 2019-03-30 13:27:42 | 2019-03-30 13:37:14 | 3 | 2.16 | 9.0 | 1.10 | 0.0 | 13.40 | yellow | credit card | Midtown East | Yorkville West | Manhattan | Manhattan | 0.741937 | 2.667228 |
| 5 | 5 | 2019-03-11 10:37:23 | 2019-03-11 10:47:31 | 1 | 0.49 | 7.5 | 2.16 | 0.0 | 12.96 | yellow | credit card | Times Sq/Theatre District | Midtown East | Manhattan | Manhattan | 1.150572 | 2.636196 |
| 6 | 6 | 2019-03-26 21:07:31 | 2019-03-26 21:17:29 | 1 | 3.65 | 13.0 | 2.00 | 0.0 | 18.80 | yellow | credit card | Battery Park City | Two Bridges/Seward Park | Manhattan | Manhattan | 1.098612 | 2.985682 |
| 7 | 7 | 2019-03-22 12:47:13 | 2019-03-22 12:58:17 | 0 | 1.40 | 8.5 | 0.00 | 0.0 | 11.80 | yellow | NaN | Murray Hill | Flatiron | Manhattan | Manhattan | 0.000000 | 2.549445 |
| 8 | 8 | 2019-03-23 11:48:50 | 2019-03-23 12:06:14 | 1 | 3.63 | 15.0 | 1.00 | 0.0 | 19.30 | yellow | credit card | East Harlem South | Midtown Center | Manhattan | Manhattan | 0.693147 | 3.010621 |
| 9 | 9 | 2019-03-08 16:18:37 | 2019-03-08 16:26:57 | 1 | 1.52 | 8.0 | 1.00 | 0.0 | 13.30 | yellow | credit card | Lincoln Square East | Central Park | Manhattan | Manhattan | 0.693147 | 2.660260 |
| 10 | 10 | 2019-03-16 10:02:25 | 2019-03-16 10:22:29 | 1 | 3.90 | 17.0 | 0.00 | 0.0 | 17.80 | yellow | cash | LaGuardia Airport | Astoria | Queens | Queens | 0.000000 | 2.933857 |
| Unnamed: 0 | pickup | dropoff | passengers | distance | fare | tip | tolls | total | color | payment | pickup_zone | dropoff_zone | pickup_borough | dropoff_borough | log_tip | log_total | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6422 | 6422 | 2019-03-22 20:17:35 | 2019-03-22 20:36:07 | 1 | 4.02 | 16.00 | 0.00 | 0.0 | 17.30 | green | cash | Washington Heights South | Spuyten Duyvil/Kingsbridge | Manhattan | Bronx | 0.000000 | 2.906901 |
| 6423 | 6423 | 2019-03-12 08:10:47 | 2019-03-12 08:35:35 | 1 | 4.30 | 18.50 | 0.00 | 0.0 | 19.30 | green | credit card | Saint Albans | Hillcrest/Pomonok | Queens | Queens | 0.000000 | 3.010621 |
| 6424 | 6424 | 2019-03-30 20:52:15 | 2019-03-30 20:59:55 | 1 | 1.70 | 8.00 | 0.00 | 0.0 | 9.30 | green | cash | Central Harlem | Central Harlem North | Manhattan | Manhattan | 0.000000 | 2.332144 |
| 6425 | 6425 | 2019-03-07 15:34:30 | 2019-03-07 16:31:06 | 1 | 9.12 | 26.32 | 0.00 | 0.0 | 26.82 | green | credit card | Park Slope | East New York | Brooklyn | Brooklyn | 0.000000 | 3.325755 |
| 6426 | 6426 | 2019-03-28 08:04:47 | 2019-03-28 08:07:46 | 1 | 0.71 | 4.50 | 0.50 | 0.0 | 5.80 | green | credit card | Central Park | Upper West Side North | Manhattan | Manhattan | 0.405465 | 1.916923 |
| 6427 | 6427 | 2019-03-23 18:26:09 | 2019-03-23 18:49:12 | 1 | 7.07 | 20.00 | 0.00 | 0.0 | 20.00 | green | cash | Parkchester | East Harlem South | Bronx | Manhattan | 0.000000 | 3.044522 |
| 6428 | 6428 | 2019-03-31 09:51:53 | 2019-03-31 09:55:27 | 1 | 0.75 | 4.50 | 1.06 | 0.0 | 6.36 | green | credit card | East Harlem North | Central Harlem North | Manhattan | Manhattan | 0.722706 | 1.996060 |
| 6430 | 6430 | 2019-03-23 22:55:18 | 2019-03-23 23:14:25 | 1 | 4.14 | 16.00 | 0.00 | 0.0 | 17.30 | green | cash | Crown Heights North | Bushwick North | Brooklyn | Brooklyn | 0.000000 | 2.906901 |
| 6431 | 6431 | 2019-03-04 10:09:25 | 2019-03-04 10:14:29 | 1 | 1.12 | 6.00 | 0.00 | 0.0 | 6.80 | green | credit card | East New York | East Flatbush/Remsen Village | Brooklyn | Brooklyn | 0.000000 | 2.054124 |
| 6432 | 6432 | 2019-03-13 19:31:22 | 2019-03-13 19:48:02 | 1 | 3.85 | 15.00 | 3.36 | 0.0 | 20.16 | green | credit card | Boerum Hill | Windsor Terrace | Brooklyn | Brooklyn | 1.472472 | 3.052113 |